Similarity Measures over Refinement Graphs
ثبت نشده
چکیده
Similarity assessment plays a key role in lazy learning methods such as knearest neighbor or case-based reasoning. In this paper we will show how refinement graphs, that were originally introduced for inductive learning, can be employed to assess and reason about similarity. We will define and analyze two similarity measures, $S_{?}$ and $S_{?}$, based on refinement graphs. The \emph{anti-unification-based similarity}, $S_{?}$, assesses similarity by finding the anti-unification of two instances, which is a description capturing all the information common to these two instances. The \emph{property-based similarity}, $S_{?}$, is based on a process of disintegrating the instances into a set of {\em properties}, and then analyzing these property sets. Moreover these similarity measures are applicable to any representation language for which a refinement graph that satisfies the requirements we identify can be defined. Specifically, we present a refinement graph for feature terms, in which several languages of increasing expressiveness can be defined. The similarity measures are empirically evaluated on relational data sets belonging to languages of different expressiveness. Source URL: https://www.iiia.csic.es/en/node/53955 Links [1] https://www.iiia.csic.es/en/staff/santiago-onta%C3%B1%C3%B3n [2] https://www.iiia.csic.es/en/staff/enric-plaza [3] https://www.iiia.csic.es/en/bibliography?f[keyword]=433 [4] https://www.iiia.csic.es/en/bibliography?f[keyword]=580 [5] https://www.iiia.csic.es/en/bibliography?f[keyword]=659 [6] https://www.iiia.csic.es/en/bibliography?f[keyword]=450
منابع مشابه
Similarity Measures over Refinement Graphs
Similarity assessment plays a key role in lazy learning methods such as knearest neighbor or case-based reasoning. In this paper we will show how refinement graphs, that were originally introduced for inductive learning, can be employed to assess and reason about similarity. We will define and analyze two similarity measures, $S_{?}$ and $S_{?}$, based on refinement graphs. The \emph{anti-unifi...
متن کاملAn Refinement Proach for Large Graphs Approximate Matching
How to match two large graphs by maximizing the number of matched edges, which is known as maximum common subgraph matching and is NP-hard. We give heuristics to select a small number of important anchors using a new similarity score, which measures how two nodes in two different graphs are similar to be matched by taking both global and local information of nodes into consideration. And then t...
متن کاملComputing Simulations on Finite and Infinite Graphs
We present algorithms for computing similarity relations of labeled graphs. Similarity relations have applications for the refinement and verification of reactive systems. For finite graphs, we present an O(mn) algorithm for computing the similarity relation of a graph with n vertices and m edges (assuming m 2 n). For effectively presented infinite graphs, we present a symbolic similarity-check...
متن کاملSimilarity Search in Structured Data
Recently, structured data is getting more and more important in database applications, such as molecular biology, image retrieval or XML document retrieval. Attributed graphs are a natural model for the structured data in those applications. For the clustering and classification of such structured data, a similarity measure for attributed graphs is necessary. All known similarity measures for a...
متن کاملRHOG: A Refinement-Operator Library for Directed Labeled Graphs
This document provides the foundations behind the functionality provided by the ρG library 1 , focusing on the basic operations the library provides: subsumption, refinement of directed labeled graphs, and distance/similarity assessment between directed labeled graphs.
متن کامل